# Document image understanding
Paligemma Rich Captions
Apache-2.0
An image caption generation model fine-tuned on the DocCI dataset based on PaliGemma-3b, capable of generating detailed descriptions of 200-350 characters with reduced hallucination
Image-to-Text
Transformers English

P
gokaygokay
66
9
Donut Base Finetuned Latvian Receipts V2
MIT
A model based on the Donut architecture, specifically fine-tuned for Latvian receipt data
Text Recognition
Transformers

D
Inesence
13
0
Donut Base Finetuned Latvian Receipts
MIT
This model is a fine-tuned version of donut-base on a Latvian receipt dataset, primarily used for receipt image processing tasks
Text Recognition
Transformers

D
Inesence
31
0
Donut Base Payslips
MIT
Document understanding model based on Donut architecture, specifically fine-tuned for payslip image processing
Text Recognition
Transformers

D
Assadullah
20
0
Featured Recommended AI Models